NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Soft convex quantization: revisiting Vector Quantization with convex optimization

Gautam, Tanmay; Pryzant, Reid; Yang, Ziyi; Zhu, Chenguang; Sojoudi, Somayeh (October 2024, PMLR)

Full Text Available
Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models

https://doi.org/10.18653/v1/2023.findings-emnlp.659

Zhang, Zhihan; Wang, Shuohang; Yu, Wenhao; Xu, Yichong; Iter, Dan; Zeng, Qingkai; Liu, Yang; Zhu, Chenguang; Jiang, Meng (January 2023, EMNLP)

Full Text Available
Generate rather than Retrieve: Large Language Models are Strong Context Generators

Yu, Wenhao; Iter, Dan; Wang, Shuohang; Xu, Yichong; Ju, Mingxuan; Sanyal, S.; Zhu, Chenguang; Zeng, Michael; Jiang, Meng (January 2023, International Conference on Learning Representations)

Full Text Available
Empowering Language Models with Knowledge Graph Reasoning for Question Answering

Hu, Ziniu; Xu, Yichong; Yu, Wenhao; Wang, Shuohang; Yang, Ziyi; Zhu, Chenguang; Chang, Kai-Wei; Sun, Yizhou (December 2022, Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing)

Answering open-domain questions requires world knowledge about in-context entities. As pre-trained Language Models (LMs) lack the power to store all required knowledge, external knowledge sources, such as knowledge graphs, are often used to augment LMs. In this work, we propose knOwledge REasOning empowered Language Model (OREOLM), which consists of a novel Knowledge Interaction Layer that can be flexibly plugged into existing Transformer-based LMs to interact with a differentiable Knowledge Graph Reasoning module collaboratively. In this way, LM guides KG to walk towards the desired answer, while the retrieved knowledge improves LM. By adopting OREOLM to RoBERTa and T5, we show significant performance gain, achieving state-of-art results in the Closed-Book setting. The performance enhancement is mainly from the KG reasoning’s capacity to infer missing relational facts. In addition, OREOLM provides reasoning paths as rationales to interpret the model’s decision.
more » « less
Full Text Available
The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions

https://doi.org/10.18653/v1/2023.emnlp-main.146

Ouyang, Siru; Wang, Shuohang; Liu, Yang; Zhong, Ming; Jiao, Yizhu; Iter, Dan; Pryzant, Reid; Zhu, Chenguang; Ji, Heng; Han, Jiawei (January 2023, Association for Computational Linguistics)

Full Text Available
A Unified Encoder-Decoder Framework with Entity Memory

https://doi.org/10.18653/v1/2022.emnlp-main.43

Zhang, Zhihan; Yu, Wenhao; Zhu, Chenguang; Jiang, Meng (January 2022, EMNLP)

Full Text Available
A Unified Encoder-Decoder Framework with Entity Memory

Zhang, Zhihan; Yu, Wenhao; Zhu, Chenguang; Jiang, Meng (January 2022, Empirical Methods on Natural Language Processing)

Full Text Available
Diversifying Content Generation for Commonsense Reasoning with Mixture of Knowledge Graph Experts

https://doi.org/10.18653/v1/2022.findings-acl.149

Yu, Wenhao; Zhu, Chenguang; Qin, Lianhui; Zhang, Zhihan; Zhao, Tong; Jiang, Meng (January 2022, Findings of the Association for Computational Linguistics: ACL 2022)

Generative commonsense reasoning (GCR) in natural language is to reason about the commonsense while generating coherent text. Recent years have seen a surge of interest in improving the generation quality of commonsense reasoning tasks. Nevertheless, these approaches have seldom investigated diversity in the GCR tasks, which aims to generate alternative explanations for a real-world situation or predict all possible outcomes. Diversifying GCR is challenging as it expects to generate multiple outputs that are not only semantically different but also grounded in commonsense knowledge. In this paper, we propose MoKGE, a novel method that diversifies the generative reasoning by a mixture of expert (MoE) strategy on commonsense knowledge graphs (KG). A set of knowledge experts seek diverse reasoning on KG to encourage various generation outputs. Empirical experiments demonstrated that MoKGE can significantly improve the diversity while achieving on par performance on accuracy on two GCR benchmarks, based on both automatic and human evaluations.
more » « less
Full Text Available
Restoring the Executability of Jupyter Notebooks by Automatic Upgrade of Deprecated APIs

https://doi.org/10.1109/ASE51524.2021.9678889

Zhu, Chenguang; Saha, Ripon K; Prasad, Mukul R; Khurshid, Sarfraz (November 2021, IEEE/ACM International Conference on Automated Software Engineering (ASE))

Full Text Available
A Survey of Knowledge-Enhanced Text Generation

https://doi.org/10.1145/3512467

Yu, Wenhao; Zhu, Chenguang; Li, Zaitang; Hu, Zhiting; Wang, Qingyun; Ji, Heng; Jiang, Meng (January 2022, ACM Computing Surveys)

The goal of text-to-text generation is to make machines express like a human in many applications such as conversation, summarization, and translation. It is one of the most important yet challenging tasks in natural language processing (NLP). Various neural encoder-decoder models have been proposed to achieve the goal by learning to map input text to output text. However, the input text alone often provides limited knowledge to generate the desired output, so the performance of text generation is still far from satisfaction in many real-world scenarios. To address this issue, researchers have considered incorporating (i) internal knowledge embedded in the input text and (ii) external knowledge from outside sources such as knowledge base and knowledge graph into the text generation system. This research topic is known as knowledge-enhanced text generation. In this survey, we present a comprehensive review of the research on this topic over the past five years. The main content includes two parts: (i) general methods and architectures for integrating knowledge into text generation; (ii) specific techniques and applications according to different forms of knowledge data. This survey can have broad audiences, researchers and practitioners, in academia and industry.
more » « less
Full Text Available

« Prev Next »

Search for: All records